# High-fidelity Audio
Csm 1b
Apache-2.0
CSM (Conversational Speech Model) is a 1B-parameter speech generation model developed by Sesame, capable of generating RVQ audio encoding from text and audio inputs.
Speech Synthesis English
C
unsloth
2,667
5
Csm 1b
Apache-2.0
A PyTorch-based text-to-speech model supporting Chinese speech synthesis, developed and released by SesameAILabs.
Speech Synthesis
C
nielsr
18
3
Sepformer Dns4 16k Enhancement
Apache-2.0
This is a speech enhancement model based on the SepFormer architecture, specifically designed for denoising tasks. It was trained on the Microsoft DNS-4 dataset and supports audio processing at a 16kHz sampling rate.
Audio Enhancement Supports Multiple Languages
S
speechbrain
1,669
20
Featured Recommended AI Models